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[Name of Document] DESCRIPTION 

[Title of the Invention] METHOD FOR PREDICTING A DRUG TRANSPORT 
CAPABILITY BY ABCG2 POLYMORPHISMS 
[Claims] 
[Claim 1] 

A method for predicting a drug transport capability of a mammalian cell 
comprising the steps of: 

collecting a sample from a mammal, 

determining a polymorphism of the nucleotide sequence of ABCG2 gene or a 
polymorphism of the amino acid sequence of ABCG2 polypeptide. 
[Claim 2] 

The method of claim 1, wherein said ABCG2 gene comprises a DNA 
consisting of the nucleotide sequence of SEQ ID NO:l, and said polymorphism of the 
nucleotide sequence is one or more of single nucleotide polymorphisms at positions 
selected from the group consisting of 34, 376 and 421 of SEQ ID NO:l. 
[Claim 3] 

The method of claim 2, wherein said single nucleotide polymorphism is 
selected from the group consisting of G34A, C376T and C421A. 
[Claim 4] 

The method of claim 2 or 3, wherein said polymorphism of the nucleotide 
sequence is determined by any one of methods selected from the group consisting of a 
direct sequencing method, TaqMan method, invader method, mass spectrometry, RCA 
method and DNA chip method. 
[Claim 5] 

The method of claim 1, wherein said ABCG2 polypeptide comprises an amino 
acid sequence of SEQ ID NO:2, and said polymorphism of the amino acid sequence is 
one or more of amino acid polymorphisms at positions selected from the group 
consisting of 12, 126, and 141 of SEQ ED NO:2. 
[Claim 6] 
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The method of claim 5, wherein said amino acid polymorphism is an amino 
acid substitution of Vall2Met or GInl41Lys, or a deletion of the amino acid sequence 
downstream from the position 126 of SEQ ID NO:2. 
[Claim 7] 

The method of claim 5 or 6, wherein said polymorphism of the amino acid 
sequence is determined by any one of methods selected from the group consisting of 
mass spectrometry, two-dimensional electrophoresis method, and protein chip method. 
[Claim 8] 

The method of any one of claims 1 to 7, wherein said drug is a compound 

represented by the following general formula (I): 

R 




halogen atom or hydroxy 1 group, R represents a lower alkylamino group (wherein said 
lower alkylamino group may be substituted with a hydrogen atom, amino group, 
formylamino group, one to three hydroxyl group(s), a pyridyl group optionally having 
substituent(s), or thienyl group optionally having substituent(s)), and G represents a 
pentose group or hexose group (said pentose group or hexose group may be substituted 
with an amino group).] 
[Claim 9] 

A polynucleotide having a single nucleotide polymorphism(s) at one or more 
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position(s) selected from the group consisting of 34, 376 and 421 of SEQ ID NO:l, 
said polynucleotide comprising any one of the positions of said single nucleotide 
polymorphisms and consisting of at least 10 contiguous nucleotides of SEQ ID NO:l, 
or a complementary polynucleotide thereto. 
[Claim 10] 

The polynucleotide of claim 9, wherein said single nucleotide polymorphism 
is selected from the group consisting of G34A, C376T, C421A and single nucleotide 
polymorphisms complementary thereto. 
[Claim 11] 

A polynucleotide having one or more of the nucleotide polymorphisms in the 
polynucleotide sequence of SEQ ID NO:l, said polymorphism selected from the group 
consisting of nucleotide polymorphisms by which the translated amino acid at position 
12 is methionine, one at position 126 is stop codon, and one at position 141 is lysine, 
and consisting of at least 10 contiguous nucleotides including one or more of 
nucleotides located at the site of said nucleotide polymorphisms, or a complementary 
polynucleotide thereto. 
[Claim 12] 

A pair of PCR primers which specifically hybridize to ABCG2 gene, and 
amplify a DNA fragment of a portion of said gene, wherein the amplified DNA 
fragment comprises a nucleotide(s) at position 34, 376 or 421 of SEQ ID NO:l 
[Claim 13] 

The pair of PCR primers of claim 12, said pair of PCR primers selected from 
the group consisting of SEQ ID NO:5 and 6; SEQ ID NO:9 and 10; and SEQ ID 
NO: 11 and 12. 
[Claim 14] 

A polynucleotide which specifically hybridizes to ABCG2 gene, and which is 
capable of detecting a polymorphism(s) of ABCG2 gene at position 34, 376 or 421 of 
SEQ ED NO: 1. 
[Claim 15] 
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The polynucleotide of claim 14, which is capable of using in any one of 
methods selected from the group consisting of a direct sequencing method, TaqMan 
method, invader method, mass spectrometry, RCA method and DNA chip method. 
[Claim 16] 

A polypeptide having polymorphic mutation(s) to ABCG2 protein defined in 
the following (a) or (b), said polypeptide is a polymorphic mutant wherein one or both 
of amino acids at positions 12 and 141 of SEQ ED NO:2 are substituted with other 
amino acid(s), a polypeptide fragment comprising said substituted amino acid and at 
least 10 contiguous amino acid residues of said polymorphic mutant, or a polypeptide 
wherein the amino acid sequence downstream from the position 126 of SEQ ID NO:2 
is deleted: 

(a) a human ABCG2 polypeptide consisting of an amino acid sequence of 
SEQ ID No: 2, 

(b) an isopolypeptide of (a) consisting of an amino acid sequence of SEQ ID 
NO:2, wherein one or several amino acids except for the amino acids at 
positions 12, 126 and 141, are deleted, substituted or added, and having a 
drug transport capability. 

[Claim 17] 

An antibody which specifically binds to the mutant ABCG2 polypeptide of 
claim 16. 
[Claim 18] 

A transformed cell which expresses an polypeptide having polymorphic 
mutation(s) to ABCG2 protein defined in the following (a) or (b), said polymorphic 
mutation(s) being one or both of amino acid substitutions Vall2Met and Glnl41Lys of 
the amino acid sequence of SEQ ID NO:2: 

(a) a human ABCG2 polypeptide consisting of an amino acid sequence of 
SEQIDNO:2, 

(b) an isopolypeptide of (a) consisting of an amino acid sequence of SEQ ID 
NO: 2, wherein one or several amino acids except for the amino acids at 
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positions 12, 126 and 141, are deleted, substituted or added, and having a 
drug transport capability. 
[Claim 19] 

A method for measuring a drug transport capability using the transformed cell 
of claim 18. 
[Claim 20] 

A method for diagnosing a drug sensitivity comprising the steps of: 
analyzing a biological sample from a subject, and determining the presence or absence 
of a polynucleotide of any one of claims 9 to 11, or a polypeptide of claim 16. 
[Claim 21] 

The method of claim 20, wherein the subject having said polynucleotide 

and/or said polypeptide is suggested to be sensitive to the compound represented by 

the following general formula (I): 

R 




[in the formula, X 1 and X 2 each independently represent a hydrogen atom, 
halogen atom or hydroxyl group, R represents a lower alkylamino group (wherein said 
lower alkylamino group may be substituted with a hydrogen atom, amino group, 
formylamino group, one to three hydroxyl group(s), a pyridyl group optionally having 
substituent(s), or thienyl group optionally having substituent(s)), and G represents a 
pentose group or hexose group (said pentose group or hexose group may be substituted 
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with an amino group).] 
[Claim 22] 

A kit for diagnosing a drug sensitivity comprising one or more of the 
following (a) to (f): 

(a) the polynucleotide of any one of claims 9 to 1 1, 

(b) the pair of primers of claim 12 or 13, 

(c) the polynucleotide of claim 14 or 15, 

(d) the polypeptide of claim 16, 

(e) the antibody of claim 17, and 

(f) the transformed cell of claim 18. 
[Claim 23] 

A computer system for analyzing data of ABCG2 polymorphism, comprising: 

(a) an input/output device, 

(b) a memory containing the polymorphism data, and 

(c) a central processing unit. 
[Detailed Description of the Invention] 
[0001] 

[Field of the Invention] 

The present invention relates to a polypeptide which excretes drugs such as 
cancer chemotherapeutic agents from a cell and to a gene coding therefor. More 
specifically, the present invention relates to a method for predicting a drug transport 
capability of a mammalian cell by determining a single nucleotide polymorphism(s) of 
ABCG2 gene and/or an amino acid polymorphism(s) of ABCG2 polypeptide and also 
to a polynucleotide, polypeptide, kit, and the like used for the method. 
[0002] 
[Prior Art] 

Prediction of sensitivity to cancer chemotherapeutic drugs has been a subject 
in conventional cancer therapy by the cancer chemotherapeutic drugs. Anti-tumor 
activity of a chemotherapeutic drug shows a great difference depending on the type of 
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cancer cells and physical trait of each patient. A chemotherapeutic drug is highly 
effective for some patients while, a resistance to the drug is observed for other patients. 
In addition, although tumors are sensitive to chemotherapeutic drugs in early stages, 
they exhibit multidrug resistance afterward. In the conventional methods however, it 
is very difficult to judge whether a chemotherapeutic drug is effective to a specific 
patient. 
[0003] 

As a major cause for the difference of sensitivity to chemotherapeutic drugs, 
there is a difference in drug concentrations in cells due to the difference in drug 
excreting capability. In those cancer cells, each of the transporters which excrete the 
chemotherapeutic drugs out of the cell is a member of ABC transporter superfamily 
(ATP-binding cassette transporter superfamily) and is a group of molecules which is 
localized in cell membrane and transports the substrate utilizing an energy source such 
as ATP hydrolysis. 
[0004] 

As representative examples of the transporter, there have been reported 
P-glycoprotein (hereinafter, referred to as "P-gp") encoded on MDR1 gene and 
multidrug resistance-related proteins (hereinafter, referred to as "MRP") encoded on 
MRP subfamily genes such as MRP1, MRP2 and MRP3. P-gp is a molecular pump 
which was already known to be involved in multidrug resistance in multiple types 
tumor, while MRP is a transporter which was firstly found to be involved in multidrug 
resistance in lung cancer and, later, found to be expressed in other types of cancer as 
well (Cole, S. P. C. et al., Science, 258, 1650-1654 (1992) and Leslie, E. M. et aL, 
Toxicology, 167, 3-23 (2001)). 
[0005] 

In recent years, new ABC family molecules have been found in succession 
and, besides P-gp and MRP, molecular pumps that are suggested to be involved in drug 
resistance are being clarified. As one of such molecules, there is a molecular pump 
called ABCG2 (BCRP/MXR/ABCP). With regard to this, there have been named and 
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reported ABCP as the gene which is expressed specifically in placenta (Allikmets, R. et 
al., Cancer Res. 58, 5337-5339 (1998)), BCRP as the gene obtained from a resistant 
cell line selected by adriamycin (Doyle, A. et al., Proc. Natl. Acad. Sci. U. S. A. 95, 
15665-15670 (1998)) and MXR as the gene obtained from a resistant cell line selected 
by mitoxantrone (Miyake, K. et al., Cancer Res. 59, 8-13 (1999)). Among these three 
kinds of genes, mutations of 1 to 4 amino acid(s) derived from the nucleotide 
substitution between the respective genes were observed. 
[0006] 

From the analysis of the cell line which is produced by introducing and 
expressing the nucleotide sequence reported as BCRP into MCF-7 cell, expression of 
this gene was shown to give resistance to mitoxantrone and adriamycin. Thus, the gene 
has been notable for a novel factor of multidrug resistance (Doyle, A. et al., Proc. Natl. 
Acad. Sci. U. S. A., 95, 15665-15670 (1998) (WO 99/40110). 
[0007] 

Under such circumstances, the present applicant found that the excretion 
pump of indolocarbazole compounds is an ABCG2 gene of SEQ ID NO:l (Komatani, 
H. et al., Cancer Research, 61, 2827-2832 (2001), WO 02/28894). In the gene 
reported as BCRP, the 482nd codon encodes threonine, while the ABCG2 gene of SEQ 
ID NO:l was a new nucleotide sequence where the 482nd codon encodes arginine. 
[0008] 

The ABCG2 gene of SEQ ED NO:l is a gene which confers a selective 
resistance on a cell to a compound of the following general formula (I) (hereinafter, 
referred to as "indolocarbazole compound"): 
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R 




[in the formula, XI and X2 each independently represents a hydrogen 
atom, halogen atom or hydroxyl group; R represents a lower alkylamino group 
(wherein said lower alkylamino group may be substituted with a hydrogen atom, 
amino group, formylamino group, one to three hydroxyl group(s), a pyridyl group 
optionally having substituent(s), or thienyl group optionally having substituent(s)); and 
G represents a pentose group or hexose group (said pentose group or hexose group 
may be substituted with an amino group).], more specifically, to the compound such as 
Compound A (wherein XI is 1 -hydroxyl group, X2 is 11 -hydroxyl group, R is 
formylamino and G is p-D-glucopyranosyl group in the general formula (I)) and 
Compound B (wherein XI is 2-hydroxyl group, X2 is 10-hydroxyl group, R is 
(l-hydroxymethyl-2-hydroxyl) ethylamino group and G is p-D-glucopyranosyl group 
in the general formula (I)). 
[0009] 

It has been shown by Northern blotting analysis that the ABCG2 gene of the 
SEQ ED NO:l, for example, is highly expressed in any cells which are resistant to both 
Compound A and Compound B (Yoshinari, T. et al., Cancer Res. 59, 4271-4275 
(1999)) and that the accumulation of indolocarbazole compounds represented by 
Compound A, Compound B etc. into the cells is selectively suppressed by the gene. 
(Komatani, H. et al., Cancer Res. 61, 2827-2832 (2001); WO 02/28894). Accordingly, 
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analysis of the genetic polymorphisms affecting the activity or expression of ABCG2 
comprising the ABCG2 gene of SEQ ID NO: 1 is thought to be useful for the selection 
of anticancer drug used for the therapy. However, such a genetic polymorphism has 
not yet been known. 
[0010] 

[Problems to be solved by the Invention] 

Under such circumstances, there has been a demand for the development of 
methods for diagnosis of excreting capability of a transporter gene product which 
excretes chemotherapeutic drugs out of the cells in each patient. For example, cancer 
chemotherapeutic drugs having an anthraquinone skeleton such as adriamycin, 
doxorubicin and mitoxantrone are not well effective to cells when the P-gp, MRP or 
BCRP is detected to be highly expressed therein. 
[0011] 

Although the indolocarbazole compounds are effective anti-cancer drugs 
regardless of the expression of the P-gp or MRP, their effect to cancer cells where 
ABCG2 is highly expressed is low. 
[0012] 

However, if the genetic polymorphism affecting the activity or the expression 
of ABCG2 can be previously detected, the detection may be useful for the selection of 
anti-cancer drugs in cancer therapy and for the selection of inhibitors of ABCG2 
activity in combined cancer therapy. 
[0013] 

For example, the ABCG2 gene of SEQ ID NO:l which is widely found is a 
gene giving an indolocarbazole compound-selective resistance on a cell while the 
ABCG2-Thr482 gene where the 482nd amino acid is modified to threonine gives a 
resistance to mitoxantrone and adriamycin in addition to indolocarbazole compounds 
and, therefore, a method for detecting the difference between those two genes is useful 
for the selection of anti-cancer drugs in cancer therapy. 
[0014] 
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In addition, a detection of ABCG2 genetic polymorphism which lowers the 
activity of ABCG2 in advance, for example, is useful for finding the optimum dose of 
the indolocarbazole compound in cancer therapy. 
[0015] 

Accordingly, it is an object of the present invention to provide a 
polymorphism of ABCG2 polypeptide related to intracellular accumulation of 
indolocarbazole compounds and of a polynucleotide coding therefor. It is also an 
object of the present invention to provide a method for detecting the presence or 
absence of the polymorphism of ABCG2 polypeptide or polynucleotide coding therefor 
in the test sample derived from patients suffering from cancer, by using a nucleic acid 
which is specific to polymorphism of AfiCG2-related gene or antibody to ABCG2 
polypeptide. It is a still another object of the present invention to provide a method 
for an effective use of indolocarbazole compounds by detecting the presence or 
absence of the polymorphism of ABCG2 polypeptide or polynucleotide coding 
therefor. 
[0016] 

[Means to solve the Problems] 

In order to solve the objects, the present inventors analyzed genomic DNA 
extracted from many human cancer cell lines and clinical samples and identified single 
nucleotide polymorphisms (SNPs) in the ABCG2 gene. It was found that those SNPs 
cause mutations such as an amino acid substitution and deletion at the specific sites of 
the ABCG2 polypeptide. Then, when cell lines expressing each of the specific 
mutant ABCG2 polypeptides were prepared and their resistance to drugs was tested, it 
was found that a drug transport capability of the mutant ABCG2 polypeptide greatly 
lowered as compared with that of wild type ABCG2 polypeptide. On the basis of 
such findings, the present invention has been accomplished. 
[0017] 

Accordingly, in a first aspect of the present invention, there is provided a 
method for predicting a drug transport capability of a mammalian cell comprising the 
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steps of collecting a sample from a mammal and determining at least a polymorphism 
of the nucleic acid sequence of ABCG2 gene or at least a polymorphism of the amino 
acid sequence of ABCG2 polypeptide. 
[0018] 

In a preferred embodiment of the present invention, the ABCG2 gene 
comprises a DNA consisting of the nucleotide sequence of SEQ ID NO:l and the 
polymorphism of the nucleotide sequence is one or more of single nucleotide 
polymorphisms at positions selected from the group consisting of 34, 376 and 421 of 
SEQ ID NO:l. It is further preferred that the single nucleotide polymorphism is 
selected from the group consisting of G34A, C376T and C421A. Here, "G34A" 
means that the 34th guanine is substituted with adenine, "C376T" means that the 376th 
cytosine is substituted with thymine and "C421A" means that the 421st cytosine is 
substituted with adenine. Polymorphism of the nucleotide sequence can be 
determined by any one of methods selected from the group consisting of a direct 
sequencing method, TaqMan method, invader method, mass spectrometric method, 
RCA method and DNA chip method. 
[0019] 

In another preferred embodiment of the present invention, the ABCG2 
polypeptide comprises a polypeptide consisting of an amino acid sequence of SEQ ID 
NO: 2 and the polymorphism of the amino acid sequence is one or more of amino acid 
polymorphisms at positions selected from the group consisting of the 12, 126 and 141 
of SEQ ID NO:2. It is preferred that the amino acid polymorphism is an amino acid 
substitution of Vall2Met or Ginl41Lys or deletion of the amino acid sequence 
downstream from the position 126 of SEQ ID NO:2. The polymorphism of the amino 
acid sequence can be determined by any of methods selected from the group consisting 
of mass spectrometric method, two-dimensional electrophoresis method and protein 
chip method. 
[0020] 

In a still preferred embodiment of the present invention, the aforementioned 
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drug is a compound represented by the following general formula (I) (hereinafter, 

referred to as "indolocarbazole compound"). 

[0021] 



R 




G 



(I) 

[In the formula, X 1 and X 2 each independently represent a hydrogen atom, 
halogen atom or hydroxyl group; R represents a lower alkylamino group (wherein said 
lower alkylamino group may be substituted with a hydrogen atom, amino group, 
formylamino group, one to three hydroxyl group(s), a pyridyl group optionally having 
substituent(s), or thienyl group optionally having substituent(s)); and G represents a 
pentose group or hexose group (said pentose group or hexose group may be substituted 
with an amino group).] 
[0022] 

In the second aspect of the present invention, there is provided a 
polynucleotide having a single nucleotide polymorphism(s) at one or more position(s) 
selected from the group consisting of 34, 376 and 421 of SEQ ID NO:l wherein the 
polynucleotide comprises any one of the positions of the single nucleotide 
polymorphisms and consists of at least 10 contiguous nucleotides or a complementary 
polynucleotide thereto. In a preferred embodiment, the aforementioned single 
nucleotide polymorphism is selected from the group consisting of G34A, C376T, 
C421A and single nucleotide polymorphisms complementary thereto. 
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[0023] 

In an embodiment, there is provided a polynucleotide having a nucleotide 
polymorphism(s) in the polynucleotide sequence of SEQ ID NO:l, wherein the 
polymorphism is one or more of nucleotide polymorphism(s) selected from the group 
consisting of nucleotide polymorphisms by which the translated amino acid at position 
12 is methionine, one at position 126 is stop codon and one at position 141 is lysine, 
and comprising at least 10 contiguous nucleotides including one or more of 
nucleotide(s) located at the site of the nucleotide polymorphisms, or complementary 
sequence thereof . 
[0024] 

In the third aspect of the present invention, there is provided a pair of PCR 
primers which specifically hybridize to the ABCG2 gene and amplify a DNA fragment 
of a portion of the gene, wherein the amplified DNA fragment comprises a nucleotide 
at position 34, 376 or 421 of SEQ ID NO:l. In a preferred embodiment, the pair of 
PCR primers are any of the primer pairs selected from the group consisting of SEQ ED 
Nos. 5 and 6, SEQ ID Nos. 9 and 10 and SEQ ID Nos. 11 and 12. 
[0025] 

In the fourth aspect of the present invention, there is provided a 
polynucleotide which specifically hybridizes to ABCG2 gene and is capable of 
detecting the polymorphism of ABCG2 gene at position 34, 376 or 421 of SEQ ID 
NO:l. In a preferred embodiment, the aforementioned polynucleotide can be used in 
any of the methods selected from the group consisting of a direct sequencing method, 
TaqMan method, invader method, mass spectrometric method, RCA method and DNA 
chip method. 
[0026] 

In the fifth aspect of the present invention, there is provided a mutant ABCG2 
polypeptide having polymorphic mutation(s) to either (a) a human ABCG2 polypeptide 
consisting of the amino acid sequence of SEQ ID NO: 2 or (b) an isopoiypeptide of (a) 
consisting of an amino acid sequence of SEQ ID NO:2 wherein one or several amino 
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acid(s) except for the amino acids at positionl2, 126 and 141 are deleted, substituted or 
added and having a drug transport capability. And the said mutant ABCG2 polypeptide 
is a polypeptide where one or both of the amino acid(s) at positions 12 and 141 of SEQ 
ID NO:2 are substituted with other amino acid(s), or it is a polypeptide fragment 
comprising the substituted amino acid(s) and at least 10 contiguous amino acid 
residues of the mutant ABCG2 polypeptide above, or it is a polypeptide where the 
amino acid residues downstream from the position 126 of SEQ ID NO:2 are deleted. 
[0027] 

In the sixth aspect of the present invention, there is provided an antibody 
which specifically binds to the mutant ABCG2 polypeptide in the fifth aspect of the 
present invention. 
[0028] 

In the seventh aspect of the present invention, there is provided a transformed 
cell which expresses an ABCG2 polypeptide having one or both of amino acid 
substitutions Vall2Met and Glnl41Lys to the amino acid sequence of SEQ ID NO:2 
defined as either (a) a human ABCG2 polypeptide consisting of an amino acid 
sequence of SEQ ID NO:2 or (b) an isopolypeptide of (a) consisting of an amino acid 
sequence of SEQ ID NO: 2, wherein one or several amino acid(s) except for the amino 
acids at positions 12, 126 and 141, are deleted, substituted or added, and having a drug 
transport capability. 
[0029] 

In the eighth aspect of the present invention, there is provided a method for 
measuring a drug transport capability using the transformed cell in the seventh aspect. 
[0030] 

In the ninth aspect of the present invention, there is provided a method for 
diagnosing a drug sensitivity comprising the steps of collecting a sample from a 
subject and determining the presence or absence of the polynucleotide in the second 
aspect or the polypeptide in the fifth aspect. In a preferred embodiment, it is 
suggested that the subject having the polynucleotide and/or polypeptide is sensitive to 
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the indolocarbazole compound. 
[0031] 

In the tenth aspect of the present invention, there is provided a kit for the 
diagnosing a drug sensitivity comprising one or more of the polynucleotide in the 
second aspect, the pair of primers in the third aspect, the polynucleotide in the fourth 
aspect, the polypeptide in the fifth aspect, the antibody in the sixth aspect and the 
transformed cell in the seventh aspect. 
[0032] 

In the eleventh aspect of the present invention, there is provided a computer 
system for the analysis of ABCG2 polymorphism comprising (a) an input-output 
device(s), (b) a memory (storage medium) containing the polymorphism data and (c) a 
central processing unit. 
[0033] 

[Preferred Embodiments of the Invention] 
(Definitions) 

In the present specification, the following terms are defined as follows unless 
otherwise mentioned. "ABCG2" is a molecular pump which belongs to the ABC 
transporter superfamily and is a name of a polypeptide by which cancer 
chemotherapeutic drugs are excreted out of a cell or a gene coding therefor. The gene 
includes cDNA and genomic gene. 
[0034] 

The term "polymorphism" refers to the existence of more than one form of a 
gene, polypeptide or portion thereof. A portion of a gene, wherein there are at least two 
different forms, i.e., two different nucleotide sequences, is referred to as a 
"polymorphic region of a gene". A polymorphic region can be a single nucleotide pair, 
or can also be nucleotide pairs in some measure of length. The term "single 
nucleotide polymorphism (SNP)" refers to the replacement of one base by another base 
and, in human genome, it is presumed that an SNP is present in several hundreds to 
one thousand base pairs. Besides that, there exists some repetitive sequences having 
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a different repeated number among individuals in the site where a unit of two bases to 
several tens bases is repeatedly present, which are called VNTR (variable number of 
tandem repeats) and a microsatellite polymorphism. The SNPs have different 
functions depending upon the position where they are located, and some exist in the 
region which is translated to polypeptide resulting in substitution or deletion of amino 
acid sequence and affecting on the function of gene, some others exist in the region 
which controls the gene expression such as promoter or intron affecting on the 
expressed amount of gene, still some others exist in other region having nearly no 
influence on the gene expression. 
[0035] 

In the present specification, the term "mammalian cell" means tissues or cells 
that constitute animal body belonging to mammal or external cell cultures of such cells. 
The term "sample" means a sample containing polynucleotides derived from living 
organisms and includes living, dead or even archaeological sample collected from 
various tissues and cells. Specific examples are body fluid (blood, urine, saliva, and 
the like), skin, root of hair, mucous membrane, internal organs, placenta and cord 
blood. 
[0036] 

In the present specification, the term "drug" means a xenobiotic having a 
physiological activity including a cancer chemotherapeutic drug used for the purpose 
of treating cancer. It includes a synthetic compound, natural compound derived from 
plants or microorganisms and a semi-synthetic compound which is synthesized from 
the natural compound. Preferably, the "drug" means a compound represented by the 
following general formula (I) (hereinafter, referred to as "indolocarbazole compound"): 
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R 



X 




N 
H 



2 



G 



(I) 



[In the formula, X 1 and X 2 each independently represent a hydrogen atom, 
halogen atom or hydroxyl group; R represents a lower alkylamino group (wherein said 
lower alkylamino group may be substituted with a hydrogen atom, amino group, 
formylamino group, one to three hydroxyl group(s), a pyridyl group optionally having 
substituent(s), or thienyl group optionally having substituent(s)); and G represents a 
pentose group or hexose group (said pentose group or hexose group may be substituted 
with an amino group).] More preferably, it means a compound of the general formula 
(I) [In the formula, X 1 and X 2 each independently represent a halogen atom or 
hydroxyl group; R represents a hydrogen atom, formylamino, or lower alkylamino 
wherein said lower alkylamino may be substituted with any one selected from the 
group consisting of one to three hydroxyl group(s), a pyridyl group optionally having 
substituent(s), and a thienyl group optionally having substituent(s); and G represents a 
hexose group which may be substituted with an amino group.] The production 
method and the like of the aforementioned indolocarbazole compounds have been 
disclosed in prior patent applications and registered patents (European patent 
publication 0528030 Al, U. S. Patent Nos. 5,591,842, 5,668,271, and 5,804,564, WO 
95/30682, WO 96/04293, WO 98/07433 and JP Patent Kokai Publication No. 
JP-A-10-245390). Particularly with regard to the production methods of Compound 
A and Compound B, they are disclosed in JP Patent Kokai Publication No 
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JP-A-6- 128283 and WO 95/30682, respectively. 
[0037] 

In the present specification, the term "polynucleotide" generally refers to both 
polyribonucleotide and polydeoxyribonucleotide, which can be either a non-modified 
RNA or DNA and either a modified RNA or DNA. The examples thereof are DNA, 
cDNA, genomic DNA, mRNA, unprocessed RNA and fragments thereof. Although 
there is no particular limitation in its length, it is usually about 10 bases or longer. On 
the other hand, the term "oligonucleotide" refers to those which are relatively shorter 
than the "polynucleotide" in length, which is generally about 50 bases or less. 
[0038] 

In the present specification, the term "polypeptide" refers to a compound 
which is linked with a peptide bond(s) consisting of two or more amino acids and 
includes a relatively short-chain polypeptide called as a peptide or oligopeptide and a 
long-chain polypeptide called as a protein. The polypeptide may contain amino 
acid(s) which is other than the genetically coded 20 kinds of amino acids. It is also 
possible to contain modified amino acid. Such a modified amino acid(s) is produced 
in vivo, for example, by a posttranslational processing or by a chemical modification 
which is known among the persons skilled in the art. The modification can be take 
place at main chain of peptide bond, side chain of amino acid, amino terminal or 
carboxyl terminal and includes, for example, acetylation, acylation, ADP ribosylation, 
amidation, biotinylation, covalent bond with lipid or lipid derivative, formation of 
cross-linking bond, disulfide bond, addition of sugar chain, addition of GPI anchor, 
phosphorylation and prenylation. 
[0039] 

(Method for Predicting the Drug Transport Capability) 

In an embodiment of the present invention, there is provided a method for 
predicting a drug transport capability of a mammalian cell, comprising the steps of 
collecting a sample from the mammal and determining at least a polymorphism of the 
nucleotide sequence of ABCG2 gene or at least a polymorphism of the amino acid 
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sequence of ABCG2 polypeptide. Here, the ABCG2 gene comprises a human cDNA 
having a nucleotide sequence shown in SEQ ID NO:l which is a gene giving a 
resistance on a cell to the indolocarbazole compound represented by Compound A. It 
further comprises a human isogene which hybridizes under a stringent condition to a 
DNA complementary to the nucleotide sequence of SEQ ID NO:l, and also encodes a 
polypeptide having a drug transport capability, as well as mammalian homologues 
thereto. The condition of "to hybridize under a stringent condition" is an 
experimental condition for hybridization which has been known among the persons 
skilled in the art. To be more specific, it means two nucleic acid fragments hybridize 
each other under a hybridization condition described in "Expression of cloned genes in 
E. coir by J. Sambrook in 9.47 - 9.62 and 11.45 - 11.61 of "Molecular Cloning: A 
Laboratory Manual: 2nd edition (1989), Cold Spring Harbor Laboratory Press, New 
York, U.S. A.". 
[0040] 

To be more specific, "under a stringent condition" means that, after 
hybridization at about 45°C in 6.0 x SSC, washing is conducted at 50°C in 2.0 x 
SSC. For the selection of stringency, salt concentration in the washing step may be 
selected, for example, from a low stringency of about 2.0 x SSC at 50°C to a high 
stringency of about 0.2 x SSC at 50°C. It is also possible that temperature for the 
washing step may be increased from a low stringency condition of room temperature 
or about 22°C to a high stringency condition of about 65°C. Incidentally, it is 
possible for persons skilled in the art to achieve a hybridization condition of the same 
stringency as the above condition by an appropriate selection of various conditions 
such as diluting ratio of SSC, concentration of formamide and temperature. 
Accordingly, the isogene includes various mutant genes which have been known 
already. For example, BCRP gene obtained from adriamycin-resistant cell line, 
ABCP gene which is specifically expressed in placenta and MXR gene obtained from 
resistant cell line selected by mitoxantrone are different from human ABCG2 gene of 
SEQ ID NO:l in several nucleotide sequences but all of them are isogenes derived 
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from ABCG2 gene of SEQ ID NO:l and are included in "ABCG2 gene" of the present 

invention. 

[0041] 

ABCG2 polypeptide is: (a) a human ABCG2 polypeptide consisting of an 
amino acid sequence of SEQ ID NO:2; (b) an isopolypeptide to (a) consisting of an 
amino acid sequence of SEQ ID NO: 2 wherein one or several amino acids except for 
the amino acid at positions 12, 126 and 141 are deleted, substituted or added and 
having a drug transport capability; or (c) a mammalian homologue to (a) or to (b). 
Here, the human ABCG2 polypeptide consisting of the amino acid sequence of SEQ 
ID NO:2 is a polypeptide which gives a selective resistance on a cell to the 
indolocarbazole compound represented by Compound A. The isopolypeptide may 
have modifications of deletion, substitution or addition of one or several amino acid(s) 
in the amino acid sequence of SEQ ID NO:2 so far as a drug transport capability which 
is a function of the ABCG2 polypeptide is remained, and numbers of modified amino 
acid in the functionally identical polypeptide are usually within 10% of total amino 
acids, preferably within 10 amino acids and, more preferably, modification numbers 
are within 3 amino acids (such as one amino acid). 
[0042] 

When the ABCG2 genetic polymorphism and ABCG2 polypeptide 
polymorphism are present in specific positions and those polymorphisms are present in 
more than a certain frequency in a specific population, the genetic significance of those 
polymorphisms becomes important. In a preferred embodiment of the present 
invention, specific' SNPs as shown in Fig. 1 are disclosed. Fig. 1 shows schematically 
how the ABCG2 polypeptide is present in ceil membrane along with the SNP sites 
according to the present invention. The ABCG2 polypeptide contains a leader 
sequence at its N-terminal necessary for localization to the cell membrane, followed by 
an ATP binding region (amino acids 6T270) and six transmembrane regions 
participating in the drug transportation. Fig. 1 shows four SNP sites which are 
mutations where the 34th guanine in SEQ ID No. 1 is substituted with adenine 
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(hereinafter, referred to as "G34A"), the 376th cytosine therein is substituted with 
thymine (hereinafter, referred to as "C376T"), the 421st cytosine therein is substituted 
with adenine (hereinafter, referred to as "C421A") and the 458th cytosine therein is 
substituted with thymine (hereinafter, referred to as "C458T"). As a result of those 
SNPs, in the amino acid sequence of the ABCG2 polypeptide, the 12th valine from the 
N-terminal is substituted with methionine (hereinafter, referred to as "Vall2Met"), the 
126th glutamine therefrom becomes a termination codon (hereinafter, referred to as 
"Glnl26Term), the 141st glutamine therefrom is substituted with lysine (hereinafter, 
referred to as "Glnl41Lys") and the 153rd threonine therefrom is substituted with 
methionine (hereinafter, referred to as "Thrl53Met"). The mutation of Vall2Met is 
present in a leader sequence necessary for localization of the ABCG2 polypeptide to 
cell membrane and the mutations of Glnl41Lys and Thrl53Met are present in an ABC 
(ATP-binding cassette) domain which is important for binding to ATP as transportation 
energy and, therefore, there is a strong possibility that those mutations affect on the 
drug transport capability of the ABCG2 polypeptide. It is apparent that a mutation of 
Glnl26Term loses a drug transport activity since a complete ABCG2 polypeptide is not 
synthesized. 
[0043] 

Drug transport capability of those mutant ABCG2 polypeptides can be 
checked by preparing a transformant which expresses the mutant ABCG2 polypeptide 
by means of a recombinant DNA technique. As will be illustrated in detail 
hereinafter in the present specification, it is noted that, as a result of measurement of 
drug sensitivity using those transformed cells, the drug transport activity of the mutant 
ABCG2 polypeptides of the aforementioned Vall2Met and Glnl41Lys is significantly 
low as compared with that of the wild type ABCG2. 
[0044] 

Alternatively, it is also possible to test whether those mutations are related to 
the drug sensitivity or not by analyzing biological samples obtained from a group of 
the subjects having a high sensitivity to a specific drug and those obtained from a 
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normal group and then analyzing the statistical relationship with the polymorphism of 
the present invention (so-called case-control study). The statistical analysis can be 
earned out using a program, etc. being known among the persons skilled in the art. 
[0045] 

With regard to a method for determining the polymorphism of the 
aforementioned nucleic acid sequence, that may be carried out using various known 
arts which will be mentioned below for (1) determination of nucleotide sequence of a 
part of allele containing at least a polymorphic site, (2) detection by a probe 
(allele-specific probe) which specifically hybridizes to a polymorphic site, (3) 
measurement of molecular weight of gene fragment containing a polymorphic site, etc. 
For example, SNP can be directly detected from genomic DNA by a direct sequencing 
method. On the other hand, it is also possible to use the aforementioned means for 
identification of (1)~ (3) after a specific genomic DNA region is amplified. Various 
methods for the DNA amplification are known to those skilled in the art and include, 
but are not limited to, cloning of a desired DNA fragment, polymerase chain reaction 
(PCR), iigase chain reaction (LCR), strand displacement amplification (SDA; Walker 
G, et al. Proc. Natl. Acad. Sci. USA, 89, 392-396 (1992)), transcription-based 
amplification (Kwoh, D. et al., Proc. Natl. Acad. Sci. USA, 86, 1173-1177 (1989)), 
self-sustained sequence replication (Guatelli, J., et al., Proc. Natl. Acad. Set USA, 87, 
1874-1878 (1990)), the Q-p repiicase system (Lizardi, P. et al., Bio/Technology, 6, 
1197-1202 (1988)), nucleic acid sequence-based amplification (NASBA; Lewis, R., 
Genetic Engineering News, 12, 1 (1992)), the repair chain reaction (RCR), LAMP 
method (WO 00/28082), and the like. 
[0046] 

SNP of the amplified product can be determined by various methods, such as 
determination of the nucleotide sequence, measurement of molecular weight by 
MALDI-TOF mass spectrometry, etc. and analysis of restriction fragment length 
polymorphism (RFLP). The single strand conformation polymorphism (SSCP) 
detection technique is also such another method for separation based on an acrylamide 
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gel, etc., but non-denaturing conditions. It is also possible to carry out by a suitable 
capillary electrophoresis. This technique makes it possible to discriminate between 
different DNA fragments by their conformation (Orita, et aL, Proc. Natl. Acad. Sci. 
USA, 86, (1989), Cotton Mutat. Res., 285, 125-144 (1993), Hayashi, Genet. Anal. Tech. 
Appl. 9, 73-79 (1992)). 
[0047] 

TaqMan (trademark) method is a method for the detection of SNP using a 
fluorescent energy transfer phenomenon where hybridization of an alleie-specific 
oligonucleotide to a template is performed simultaneously with PCR (cf. Livak, ef al. t 
PCR Methods and Application, 4:357-362, 1995 and U. S. Patent No. 5,528,848). 
The alleie-specific probe which is labeled with a fluorescent dye and a quencher is 
hybridized to a target site and PCR is performed using a primer which is designed to 
amplify the region including the site whereupon the hybridized probe is cleaved by 
5' -nuclease activity of Taq polymerase as the elongation reaction from the primer 
proceeds. When the fluorescent dye is separated from the quencher, fluorescence is 
resulted and the template is amplified by the PCR whereupon fluorescent intensity is 
potentiated exponentially. When probes which are specific to two kinds of alleles are 
labeled with different fluorescent dyes, it is possible to discriminate homozygote from 
heterozygote by one assay. 
[0048] 

A variety of methods without amplification of DNA have been developed. 
For example, Invader method (trademark) is based on a special enzymatic reaction 
where two kinds of oligonucleotides (invader probe and allele probe) are used and a 
specific structure formed by those probes with a template DNA is recognized and 
cleaved and it is described, for example, in U. S. Patent Nos. 5,846,717, 5,614,402, 
5,719,028, 5,541,311 and 5,843,669. In this method, the target nucleotide sequence is 
recognized by two different probes. The first probe is usually called an invader probe 
and is substantially complementary to the first site of the target nucleotide sequence. 
The second probe is called allele probe and its 3' -terminal side is substantially 
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complementary to the second site of the target nucleotide sequence while its 
5'-terminal side contains a sequence called a tail or flap which is non-complementary 
to the template to form a single strand. When those probes hybridize to an adjacent 
region of the template, the 3' -terminal of the invader probe invades into an SNP site 
and this structure is cleaved by cleavase whereupon a flap is liberated. The liberated 
flap can be quantified when it is previously labeled. Preferably, in order to quantify 
the liberated flap, the third FRET (fluorescence resonance energy transfer) probe 
(including a sequence complementary to the flap and a self-complementary sequence) 
labeled with a fluorescent dye and a quencher may be used. The liberated flap forms 
a specific structure by binding to the FRET probe, and the part of fluorescent dye in the 
FRET probe is cleaved by cleavase whereupon fluorescence is generated. When two 
sets of flap-FRET probes are prepared and labeled with different fluorescent dyes, it is 
possible to discriminate each homozygote and heterozygote by one assay. 
[0049] 

MALDI-TOF mass spectrometry is a method which can process a large 
number of samples in a short time without fluorescence labeling of the primer. A 
primer adjacent to the SNP site is prepared and one base elongation reaction from the 
primer is performed using ddNTP and PCR-amplified sample DNA as a template. 
The ddNTP added to the elongation reaction product is discriminated by mass 
spectrometry. 
[0050] 

RCA (rolling circle amplification) is a method where a DNA amplifying 
means, in which a long complementary-stranded DNA is synthesized as DNA 
polymerase moves on a cyclic single-stranded template DNA, is applied to an SNP 
typing. Recognition of SNP (allele) is carried out by checking whether amplification 
is available by an RCA method. Namely, a single-stranded probe (padlock probe), 
which is able to give a ring form when annealed with genomic DNA, is hybridized to 
genomic DNA to conduct a chain reaction. When the terminal of the probe is made 
as a site of SNP, a connected ring is formed whereupon amplification by RCA takes 

25 



JP-2002-175806 



filed on June 17, 2002 



place if that site is matched while, when mismatched, no connection takes place giving 
no ring whereupon RCA amplification does not take place. Discriminating the two 
amplification reactions make it possible to determine SNP (Lizardi, P. M., et al., Nat. 
Genet., 19, 225 (1998)). 
[0051] 

DNA chip method where PCR-amplified fluorescence-labeled cDNA or cRNA 
is hybridized to various oligonucleotide probes including polymorphism site using 
DNA chip arranged on a microarray is useful as a means for quick detection of many 
SNPs. There have been known a thing where oligonucleotide is synthesized on an 
array by an optical lithographic technique so that several thousands to several hundred 
thousands probes are arranged on a chip (manufactured by Affymetrix; cf. U. S. Patent 
Nos. 5,424,186, 5,744,101 and 6,040,138), and a method where a previously-prepared 
cDNA or oligonucleotide is fixed on glass by means of pin or ink jet system (cf. U. S. 
Patent No. 6,040,138). 
[0052] 

With regard to a method for determining the polymorphism of the 
aforementioned amino acid sequence, various methods have been known and examples 
thereof are a proteome analysis by a two-dimensional electrophoresis or microfluidics 
method (Vreeland, Wyatt N and Barron, Annelise E, Current Opinion in Biotechnology , 
Vol. 13, pages 87-94 (2002)), peptide mapping and amino acid sequence analysis using 
a mass spectrometric devices, amino acid sequence analysis by a protein sequencer and 
a method where interaction between polypeptide and ligand is detected using protein 
chips, etc. 
[0053] 

The two-dimensional electrophoresis is usually a method where isoelectric 
focusing is conducted in the first dimension while SDS-PAGE (sodium dodecyi 
suifate-polyacrylamide gel electrophoresis) is conducted in the second dimension, and 
several thousands of proteins can be separated by one sheet of gel. In the isoelectric 
focusing, a carrier-ampholyte has been used already but, in recent years, immobilized 
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pH-gradient gel; IPG) strips have been put on the market and it is now possible to 
separate with a good reproducibility without causing a pH drift. In the SDS-PAGE, 
there are a continuous buffer system where one type of pH buffer is used and a 
discontinuous buffer system using buffers of plural pH values. It is also possible to 
use a low-BIS concentration gel electrophoresis, a concentration-gradient gel 
electrophoresis, a Tricine-SDS-PAGE, etc. depending upon the type of the protein to be 
separated. The separated protein can be usually quantified by staining with a dye 
such as Coomassie Blue. In a silver staining method, protein can be identified in a 
sensitivity of 20- to 100-fold as compared with the staining with Coomassie Blue. 
Alternatively, detection with a high sensitivity on a gel is possible using commercially 
available fluorescent dyes such as SYPRO Ruby and SYPRO Orange (Patton, W. E, 
Electrophoresis, 21, 1123-1144 (2000)). It is also possible to specifically detect an 
ABCG2 polypeptide by a western blotting method using an antibody to the ABCG2 
polypeptide. 
[0054] 

Mass spectrometry is a technique by which mass (molecular weight) is 
precisely measured and, in recent years, this method makes it possible to measure 
precisely the molecular weights of the nucleic acids and proteins by making practical 
use of ionizing (highly hydrophilic) high-molecular substances having high polarity 
such as protein, etc. without decomposition. As one of such mass spectrometric 
methods, there has been known MALDI-TOF/MS (matrix-assisted laser desorption 
ionization time-of-flight/mass spectrometry). This is a method where, after mixing of 
a protein sample with a matrix which absorbs laser beam such as sinapinic acid 
(3,5-dimethoxy-4-hydroxycinnamic acid) followed by drying, a strong pulse laser is 
irradiated to conduct ionization of the protein sample by energy transfer from the 
matrix and molecular weight of the ion is analyzed by the difference in flying time of 
the molecular ion samples by the initial acceleration. In order to fragment the peptide 
in the inner area of the mass spectrometer and to obtain a structural information (such 
as amino acid sequence or amino acid composition) from analysis of the mass of the 
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fragment, a tandem mass spectrometry (MS/MS) where plural mass separating parts 
are connected is utilized and, for such a purpose, there are also used analyzers of a 
triple quadrupole type or a hybrid type using an electrospray ionization method and of 
an ion trap type, etc. 
[0055] 

Protein chip method is a technique which involves interaction of a sample 
with proteins, peptides, etc. placed on a substrate comprehensively and quickly and, 
with regard to ligands to be immobilized on the substrate, there have been developed 
peptides, antibodies, expressed proteins, etc. 
[0056] 

(Polynucleotide, Pair of primers and Kit) 

In another embodiment of the present invention, there are provided a . 
polynucleotide containing the ABCG2 genetic polymorphism, a pair of primers for 
amplifying the DNA fragment containing the aforementioned polymorphisms, a 
polynucleotide for the detection of the aforementioned polymorphisms and a kit 
therefor. 
[0057] 

In an embodiment, the polynucleotide of this embodiment is a polynucleotide 
having a single nucleotide polymorphism(s) at one or more position(s) selected from 
the group consisting of 34, 376 and 421 of the SEQ ID NO:l wherein the 
polynucleotide comprises any one of the positions of the aforementioned single 
nucleotide polymorphisms and consists of at least ten continuous nucleotides or 
complementary nucleotide thereto. Accordingly, a polynucleotide containing none of 
the single nucleotide polymorphisms at the aforementioned three positions or, in other 
words, a polynucleotide where the nucleotide sequences in the above three places are 
the same as those in SEQ ID NO:l is not included in the present embodiment. 
Preferably, in those nucleotide sequences, the 34th guanine is substituted with adenine, 
the 376th cytosine is substituted with thymine or the 421st cytosine is substituted with 
adenine and, in addition to those, substitution in the nucleotide sequence where codon 
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of the 12th amino acid from N-terminal of the ABCG2 polypeptide is varied to 
methionine, the 141st codon thereof is varied to lysine or the 126th codon thereof is 
varied to termination codon is included as well. 
[0058] 

Those polynucleotides may be natural or synthetic compounds. For example, 
they may be manufactured by replication (duplication) of cDNA or genomic DNA 
within host cells using recombinant DNA techniques. Alternatively, they may be 
manufactured by synthesis in vitro. With regard to the synthetic method, it is possible 
to amplify the DNA by means of PCR or the like or to synthesize the DNA by 
chemical synthesis. For the persons skilled in the art, it is possible to introduce a 
site-specific mutation(s) into the ABCG2 gene of SEQ ID NO:l by a known method to 
prepare the polynucleotide of the present embodiment. Examples of the method for 
introduction of site-specific mutations which is known to the persons skilled in the art 
are Kunkel method (Kunkel, T. A. et al., Methods Enzymol. 154, 367-382 (1987)), 
double primer method (Zoller, M. J and Smith, M., Methods EnzymoL, 154, 329-350 
(1987)), cassette mutation method (Wells, et al., Gene, 34, 315-23 (1985)) and 
mega-primer method (Sarkar, G. and Sommer, S. S., Biotechniques, 8, 404-407 
(1990)). 
[0059] 

Those polynucleotides may be used for the detection of the genetic 
polymorphisms concerning the present invention. They may be also used for the 
suppression of the gene expression as antisense DNAs. 
[0060] 

In another embodiment, there is provided a pair of PCR primers which 
specifically hybridize to the ABCG2 gene and amplify a DNA fragment of a portion of 
the gene wherein the amplified DNA fragment comprises a nucleotide at position 34, 
376 or 421 of SEQ ID No. 1 . The pair of primers of this embodiment are designed so 
as to be substantially complementary to each chain in specific regions of upper stream 
and downstream of the aforementioned polymorphic sites of ABCG2 gene. Although 
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each of those primers can be hybridized at the region which is apart in 25~ 2500 base 
pair, it is preferred that the size of the amplified product is 100" 500 base pair so as to 
make determination of nucleotide sequence or analysis of molecular weight of the 
amplified product easy. More preferably, the size of the amplified product is 80~ 200 
base pair. Although the length of those oligonucleotide primers may be within a 
range of 10~30 bases, there may be used an oligonucleotide primer having preferably 
18~25 bases and, more preferably, an oligonucleotide primer having 20"* 22 bases as 
shown in SEQ ID Nos. 5 and 6, SEQ ID Nos. 9 and 10 and SEQ ID Nos. 11 and 12, 
respectively. Those primers may be labeled for making the detection of the amplified 
DNA fragment easy. With regard to the label, there may be used, for example, 
radioisotope, enzyme, fluorescent dye, streptoavidin, avidin, magnetic beads, antigen 
and antibody. 
[0061] 

In still another embodiment, there is provided a polynucleotide which 
specifically hybridizes to the ABCG2 gene and which is capable of detecting a 
polymorphism of the ABCG2 gene at positions 34, 376 or 421 of SEQ ID NO:l. 
With regard to a method for the detection, there are various methods and, for example, 
in conducting a detection by an invader method, there are provided an invader probe 
being designed to complimentary bind to the 3 '-side of the template from the SNP 
site, and an allele probe containing a complementary sequence to the 5' -side of the 
template from the SNP site and having a sequence (flap) which is unrelated to the 
sequence of the template at the 5 '-side thereof. The 3' -terminal of an invader probe 
which is a sequence of SNP site may be any base. 
[0062] 

A TaqMan probe is a polynucleotide which contains an SNP site and has a 
length of about 20 bases being complementary to a template. Its 5 '-terminal is 
labeled with a fluorescent dye such as FAM or VIC while 3 '-terminal is labeled with a 
quencher (optical quenching substance). 
[0063] 
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In a padlock probe used for RCA method, each of its both ends comprises 
about 20 bases near the SNP on genome and two of them are linked by a specific 
sequence called a backbone. 
[0064] 

Preferably, the aforementioned detection is carried out by determining the 
nucleotide sequence using genomic DNA directly or using amplified DNA fragments. 
A sequence primer for the determination of a nucleotide sequence is designed so as to 
make it substantially complementary to an appropriate site in upper stream or 
downstream of the SNP site. Although the length of those sequence primers may be 
within a range of 10 to 30 bases, those having 18 to 25 bases may be preferably used 
and, more preferably, primers in an ordinary direction (sense primer) or primers in an 
inverted direction (antisense primer) as shown in SEQ ID Nos. 37, 38,-41, 42, 43 and 
44 may be used. Those oligonucleotides may be chemically synthesized by various 
methods which are known among the persons skilled in the art. They may be also 
labeled for making the detection easy. With regard to a method of labeling, there may 
be used radioisotope, enzyme, fluorescent substance, streptoavidin, avidin, biotin, 
magnetic fine particles, antigen and antibody, etc. 
[0065] 

In another embodiment, there is provided a kit for the prediction and detection 
of drug transporting capability of mammals. The kit contains either or both of a pair 
of primers for amplifying the DNA fragments containing the ABCG2 polymorphism 
and a polynucleotide for detecting the polymorphism. It is possible that the target 
DNA is firstly amplified from the sample to be tested and genetic polymorphisms are 
determined using the amplified DNA. On the other hand, it is also possible to 
determine the polymorphism directly from genomic DNA without amplification 
reaction of DNA. With regard to such a method, Invader method may be exemplified. 
As optionally selected attachments, the kit may contain a reagent for extraction and for 
purification of DNA, reagent for PCR such as 10-fold concentrated buffer, 
heat-resistant DNA polymerase, four kinds of nucleotide triphosphates (dNTPs), etc. 
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[0066] 

(Polypeptide) 

In still another embodiment of the present invention, there is provided a 
polypeptide having a polymorphic mutation relevant to the present invention in the 
ABCG2 polypeptide or isopolypeptide thereof. The ABCG2 polypeptide is a 
polypeptide consisting of an amino acid sequence of SEQ ED NO: 2 and the 
isopolypeptide thereof is a polypeptide consisting of an amino acid sequence where 
one or several amino acid(s) is/are deleted, substituted or added in the ABCG2 
polypeptide and having a drug transport capability. Mutant polypeptides which have 
been known already such as BCRP, ABCP and MXR are also included therein. The 
polymorphism in this embodiment is substitution of either or both of the 12th and 
141st amino acids in SEQ ID NO: 2 with other amino acid(s) or deletion of an amino 
acid sequence which is downstream from the 126th of SEQ ID NO:2. The amino acid 
substitution may be anything so far as it is other than an amino acid residue shown in 
SEQ ID NO:2 but, preferably, the 12th and 141st amino acids in SEQ ID NO:2 are 
substituted with methionine and lysine, respectively. 
[0067] 

The present embodiment further includes polypeptide fragment of the ABCG2 
polypeptide or isopolypeptide thereof where either or both of the 12th and 141st amino 
acid(s) of SEQ ID No. 2 is/are substituted with other amino acid(s). The polypeptide 
fragment comprises at least 10 contiguous amino acid residues, preferably 20 or more 
contiguous amino acid residues and, more preferably, it has the length of 30 or more 
amino acid residues. Those polypeptides or fragments thereof are useful for the 
preparation of an antibody to the polypeptide having polymorphic mutations. 
[0068] 

Such polypeptides having polymorphic mutation(s) can be manufactured by 
means of chemical synthesis and they further include natural polypeptides and those 
which are prepared as recombinant polypeptides utilizing genetic recombination 
techniques. The natural polypeptides may, for example, be an extracted and purified 
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polypeptide from tissues such as placenta where the human mutant ABCG2 
polypeptide of this embodiment is thought to be expressed. On the other hand, the 
recombinant polypeptides can be prepared, as will be mentioned later, by cultivation of 
cells transformed by DNA coding for the human mutant ABCG2 polypeptide of this 
embodiment. 
[0069] 

The expressed or isolated polypeptide or the fragment thereof may be detected 
by known methods and it is possible to detect by, for example, Coomassie Blue 
staining, silver staining, western blotting method using an antibody specific to 
polypeptide having a polymorphic mutation, etc. In addition, those polypeptides may 
be purified by the methods which have been known already. Those methods include 
precipitation with ammonium sulfate, gel filtration chromatography, ion-exchange 
chromatography and affinity or immunochromatography. 
[0070] 
(Antibody) 

In an embodiment of the present invention, an antibody which specifically 
binds to the mutant ABCG2 polypeptide having a polymorphism(s) of the present 
invention is provided. The antibody of this embodiment can be prepared according to 
the method known to the skilled person in the art (refer, for example, to "Shin 
Seikagaku Jikken Koza (New Experimental Course of Biochemistry) 1, Protein I, 
pages 389-406, Tokyo Kagaku Dojin"). The preparation of polyclonal antibody is 
performed, for example, as follows. To an immunocompetent animal such as rabbit, 
guinea pig, mouse and chicken is administered the appropriate dose of the mutant 
ABCG2 protein of the present invention or the partial peptide thereof. The 
administration may be accompanied by an adjuvant (FIA or FCA) which promotes the 
antibody production. It is generally administered every several weeks. Multiple 
immunizations can elevate the antibody titer. After the final immunization, antiserum 
is obtained by collecting blood from the immunized animal. The polyclonal antibody 
can be prepared from this antiserum by, for example, fractionation with ammonium 



33 



JP-2002- 175806 



filed on June 17, 2002 



sulfate precipitation and/or anionic exchange chromatography and/or by affinity 
purification using Protein A and/or immobilized antigen. On the other hand, a 
monoclonal antibody is prepared, for example, as follows. The mutant ABCG2 
polypeptide of the present invention or the partial peptide thereof is immunized to an 
immunocompetent animal as described above, and after the final immunization, spleen 
or lymph node is collected from the immunized animal. A hybridoma cell is prepared 
by the cell fusion of the antibody-producing cell which is contained in this spleen or 
lymph node and a myeloma cell using polyethylene glycol or the like. The aimed 
hybridoma is screened and cultivated and a monoclonal antibody can be prepared from 
the culture supernatant. Purification of the monoclonal antibody can be performed, 
for example, by fractionation with ammonium sulfate precipitation and/or anion 
exchange chromatography and/or by affinity purification using Protein A and/or 
immobilized antigen. The antibody thus prepared is used for an affinity purification 
of the mutant ABCG2 polypeptide of the present invention and may be used for 
detecting the amount of the expression of the mutant ABCG2 polypeptide of the 
present invention as well. The detection of the expressed amount of the mutant 
ABCG2 polypeptide of the present invention in a mammalian cell by the antibody 
makes it possible to determine the sensitivity of the mammalian cell to the compound 
represented by the formula (I). It is further possible that the detection of the mutant 
ABCG2 polypeptide of the present invention in a cancer cell or cancer patient by this 
antibody can be used for the pharmacogenomical therapy which determines the 
patient's constitution such as drug sensitivity for administration of optimum drug for 
the patient. 
[0071] 

(Transformed Cells and Method for Measuring the Drug Transport Capability Using 
the Transformed Cells) 

The present invention further relates to a transformed ceil which expresses an 
ABCG2 polypeptide having one or both of amino acid substitution(s) Vall2Met and 
Glnl41Lys of the amino acid sequence shown by SEQ ID NO:2 in: (a) a human 
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ABCG2 polypeptide consisting of an amino acid sequence of SEQ ID NO:2 or (b) an 
isopolypeptide of (a) consisting of an amino acid sequence of SEQ ID NO:2 wherein 
one or several amino acid(s) except for the amino acids at position(s) 12, 126 and 141 
is/are deleted, substituted or added and having a drug transport capability. The term 
"transformed cell" refers to a cell where an exogenous DNA is incorporated into a host 
cell by a recombinant vector. The host cell may be either a prokaryotic cell or an 
eukaryotic cell and includes any cell which can be used for the object of the present 
invention, such as bacterium, yeast cell, insect cell or animal cell. To be more 
specific, it is possible to introduce the recombinant vector into the host cell by the 
following method whereupon the transformant is obtained. Transformation of 
Escherichia coli is carried out by the method of Hanahan (Hanahan, D., J. Mol. BioL 
166, 557-580 (1983)), the electroporation method (Dower, W. J., et al., Nucl. Acid Res. 
16, 6127-6145 (1988)), and the like. Transformation of yeast is carried out, for 
example, by spheroplast method (Beach, D. and Nurse, P., Nature, 290, 140 (1981)), 
lithium acetate method (Okazaki, K., et al., Nucleic Acids Res., 18, 6485-6489 (1990)), 
etc. Transformation of insect cell may be carried out by a method, for example, 
described in Bio/Technology, 6, 47-55 (1980). Introduction of recombinant DNA into 
mammalian cells is carried out by a calcium phosphate method (Graham, F. L. and van 
der Eb, A. J., Virology, 52, 456-467 (1973)), a DEAE-dextran method (Sussman, D. J. 
and Milman, G, Mol. Cell BioL, 4, 1641-1643 (1984)), a lipofection method (Feigner, 
P. L. et al., Proc. Natl. Acad. Sci. USA, 84, 7413-7417 (1987)), an electric perforation 
method (Neumann, E., 'et al., EMBO J., 1, 841-845 (1982)), etc. The transformed 
cells prepared as such may, for example, be used for a method for measuring the drug 
transporting activity, analysis of the drug excretion mechanism or screening of 
compound which regulates the drug transporting capability. 
[0072] 

(Diagnosing Method and Diagnosing Kit) 

In a different embodiment of the present invention, there is provided a method 
for diagnosing a drug sensitivity of a subject by the detection of the polymorphisms of 

35 



JP-2002-175806 



filed on June 17, 2002 



the present invention or a kit therefor. The drug includes cancer chemotherapeutic 
drugs and it is clinically useful to diagnose the sensitivity therefor. For example, 
when a chemotherapeutic drug is administered to a specific patient suffering from 
cancer, responsiveness of the patient are different and there are big differences such as 
significantly effective, lowly effective and ineffective at all. This is because of a 
possibility that, since genetic background is different for each patient, activity of 
excreting the chemotherapeutic drug out of the cancer cell is greatly different. 
Accordingly, the diagnosis method of this embodiment is quite useful for deciding 
what type of chemotherapeutic drug or chemotherapeutic drug group is to be 
administered and/or for deciding the effective dose of chemotherapeutic drug or 
chemotherapeutic drug group. In a preferred embodiment, the subjects having 
polymorphisms shown in Table 3 are suggested to be sensitive to the indolocarbazoie 
compound represented by the formula (I). Accordingly, a therapy by an effective 
dose of the said compound can be applied to the patient suffering from cancer for 
which the above compound is effective and a significant improvement in therapeutic 
effect as well as a big reduction in side effect can be expected. 
[0073] 

In another embodiment of the present invention, there is provided a kit for 
diagnosing a drug sensitivity containing one or more of the polynucleotide, pair of 
primers, polypeptide, antibody and transformed cell of the present invention. The 
diagnosing kit may contain an appropriate package for a safe storage of the 
constituting reagents and a package insert for illustrating the method of the present 
invention. It may further contain an appropriate buffer, nucleotide, polymerase such 
as heat-resistant polymerase and fluorescent substance for the detection. 
[0074] 

(Computer System) 

In another embodiment of the present invention, there is provided a computer 
system where at least one SNP(s) of the ABCG2 gene or at least one polymorphic 
mutant polypeptide sequence(s) concerning the ABCG2 polypeptide is stored and 
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displayed. This computer system includes an input/output device, a central 
processing unit and a readable storage medium (memory) where the aforementioned 
polymorphic sequence data are stored. The above polymorphic sequence data include 
nucleotide sequence, genetic type and haplotype of the ABCG2 gene in a subject 
population or amino acid sequence, spots by two-dimensional electrophoresis, mass 
spectrometric data, etc. of the ABCG2 polypeptide. These data are processed by 
various programs and can be used for determination of genetic type, linkage 
disequilibrium analysis, etc. In a preferred embodiment, result of those analyses can 
be used for the prediction of drug sensitivity of mammalian cells. 
[0075] 
[Examples] 

The present invention is explained in more detail by reference to the following 
examples which are results of identification of single nucleotide polymorphisms of the 
ABCG2 gene using human genomic DNAs, and then preparation of cell lines 
expressing mutant ABCG2 polypeptides to analyze the function thereof. However, 
these examples do not restrict the scope of the present invention. 
[0076] 

[Example 1] Identification of SNPs in Human ABCG2 Gene 

The present inventors firstly extracted genomic DNAs from 30 human cancer 
cell lines and also from human clinical samples of 149 persons (whites) and identified 
the SNPs by sequencing the ABCG2 gene. 
[0077] 

The 30 cancer cell lines are A-427, DLD-1, NCI-H69, HeLa S3, PC-13, 
MKN-45, UM-UC-3, HCT116, PA-1, RT4, MKN1, SK-OV-3, MADH, KATOIII, 
U118, HS746, T24, MSTO-211H, OVCR3, Lul35, Lx-1, SCC25, Cal27, MKN-74, 
SCaBER, BxPC-3, Hela, J82, NCI-H 187 and ES-2. Genomic DNA was extracted 
from those cell lines with Trizol reagent (Gibco BRL). Human clinical samples w«re 
purchased from IMPATH-BCP Co. Nucleotide sequences of sixteen exons and 
peripheral introns of the ABCG2 gene were determined by direct sequencing. Firstly, 
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sixteen exons were amplified from genomic DNA by PCR (LA Taq Takara) using each 
pair of primers shown in Table 1. Next, amplified DNA fragments were treated with 
ExoSAP-IT (USB corporation) to digest remaining primers and to remove unwanted 
dNTPs. Then, the DNA fragments were applied to cycle sequencing reaction with 
dye terminator method (Dynamic ET Dye Terminator Cycle Sequencing Kit; 
Amersham) using sense primers shown in Table 2. After the removal of 
dye-terminator by G-50 gel filtration column, the nucleotide sequences and SNPs were 
determined by capillary sequencer MegaB ACE 1000 (Molecular Dynamic). The 
identified SNPs were reconfirmed by sequencing using antisense primers shown in 
Table 2. 
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[0078] 

Table 1 





Forward Primers 


neverse rrimers 


Exon 1 


S'-GTGCC C ACTC AAAAGGTT-3' 


S'-TCCAGTCAAAGCTGTACTCTG-S' 


Exon 2 


5-ATGTATTGTCACCTAGTGTTTG-3' 


5'-AAAGTGTGAAGCCTTGAGCAGA-3' 


Exon 3 


5-AACGGAGATGTTTCACAAGA -3' 


5-TACAATAAAGCCCCAAAACA -3' 


Exon 4 


5-GAGGAAAAAGAATGGGAGAA -3* 


S-GTCTGCAAAGCCTGCTATAA-S' 


Exon 5 


S'-TTCCTTCACCTTTC 1 1 1 1 CC-3' 


5*-CTTCCATAAAACTGGTCCCT-3* 


Exon 6 


5'-GAGGTGCTTTGTATCAGGCT-3' 


5'-GATCAGGCCAGTAGGTCAAC-3' 


Exon 7 


5'-CTTGTAAATACTTGCAGATTACCTG-3* 


5'-TGTTCAAGTGACAGAATAAATGGCT-3' 


Exon 8 


5-AAAGGGTAAAATTACGTGGG -3' 


5'-GCAAACAAACTGACG 1 1 1 IC-3' 


Exon 9 


0 — AA 1 UAAuu 1 Li 1 1 AUuuAAUU o 


5'-CTGGCTGACACTTCTTTCAC-3' 


Exon 10 


r-i Tr'TO^OP A A APP APAf^ATA APT-^' 


*i'-P ATTTAA AAATAATTGGGCCAGGTG-3' 

3 V^//A III /A/A /A /A /A 1 r\n 1 1 VJVi\JVi'vr»W I « wr 


Exon 1 1 


C» PTA ATTAnPTTPPAAAP,P,nP -1' 


^'-AAACCAGGCTGCTCTTTACT-S' 


Exon 1 2 


5'-GCTGGGTATTTTTCAAGGAT-3' 


S'-AGAGAGTG CAAAATGGACAG-3' 


Exon 1 3 


5 , -TGCCTGTAGCTCTTCATCTC-3 , 


S'-ACGAGAGGGAACCAAAATAG-S* 


Exon 1 4 


S-r/rTTTTGGCAGCTTTAAATGATAGC-S' 


5'- AATCTTTCTCCTTT ACT AG G AGGT A-3* 


Fvr\n 1 R 

exon i vi 


^'-TTTACTTCTTTTGTATTGGAAGCCA-S' 


5'-TAGAGGATAAATCGATTGATAGGGA-3' 


Exon 1 6 


5'-ATCTGAAGGGGTAATTATTAAAGGC-3' 


S'-TGTTCCAGAAATGGTGCAAGAATTC-S' 


[0079] 








Table 2 






Sense Primers 


Antisense Primers 


Exon 1 


5 , -GTGCCCACTCAAAAGGTT-3 , 


S'-CAAGAGTrTTTACCAACCCA-S' 


Exon 2 


5'-ATGTATTGTCACCTAGTGTTTG-3 


S'-GTGGCCCAATTATTTCACT-S* 


Exon 3 


S'-TAAGAGTTGGTTTGTGCTTG-S' 


S'-AACATGGTCAACTGCTACAT-S* 


Exon 4 


5-ATGTTTTGGGGCTTTATTG-3' 


S-TATTCCAGATTCTCCCTGC-S' 


Exon 5 


5-CAGGCTTTGCAGACATCTA-3 


5'-ATTGTTATGGAAAGCAACCA-3' 


Exon 6 


S'-GAGGTGCTTTGTATCAGGCT-S* 


S-CACCCTCATCACAGACATC-S* 


Exon 7 


S'-CTGTCCTAGAATCTGCATTT-S' 


5'-AGCTGGTGCTACAAAAAT-3 f 


Exon 8 


5-AAAGGGTAAAATTACGTGGG -3' 


5'-TCTGGTTGTTGCTTCCTACT-3' 


Exon 9 


5-GTTAGGGAAGCATCCAAGA-3' 


5'-AGGGAAGCTTTCCAAAAGTA-3' 


Exon 10 


5-TCTCCCCAAAGCACAGATAACT-3' 


5-T G GT GGTG G ATGTCTGT AGT-3' 


Exon 1 1 


5 ? -CTAATTACCTTCCAAAGGGC-3' 


5-GCTCAGGAI 1 1 ICTTCCCTA-3' 


Exon 12 


S'-CTGGACTGAGTGTTCAGGAG-S' 


5-AGAGAGTGCAAAATGGACAG-3' 


Exon 13 


5-TGCCTGTAGCTCTTCATCTC-3' 


5-ATAAGGGCAAAGAGGAAAGT-3' 


Exon 14 


S'-TTTGTTCTTCCTTTAAAACCG-S' 


S'-AATCTTTCTCCTTTACTAGGAGGTA-S* 


Exon 15 


5'-.jj7AGTTCTTTTGTATTGGAAGCCA-3' 5-AAAAGGCCCAAAACAATAAG-3* 



Exon 16-1 5-ATCTGAAGGGGTAATTATTAAAGGC-3' 5'-CAGGAGTTTCCAGAATTCAA-3' 
Exon 16-2 5'-TGTTGTTTTCTGTTCCCTTG-3' 5'-TGTTCCAGAAATGGTGCAAGAATTC-3' 
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[0080] 

Results of the identified SNPs in the 30 human cancer cell lines and in the 
human clinical samples for 149 persons (whites) on the basis of the aforementioned 
determination of nucleotide sequences are shown in Table 3. In the column for 
domain in Table 3, ABC means ATP binding cassette, EC means extra cellular region, 
TM means transmembrane region and UTR means untranslated region. For example, 
when a mutation site is shown by counting the first adenine in the translation initiation 
codon as the 1st one, G34A was found in five cell lines (16.7%) in 30 kinds of cancer 
cell lines and in 29 persons (19.5%) in human clinical samples of 149 persons. 
Incidentally, the mutation where 10th adenine from the 5'-side of intron 3 is substituted 
with guanine is shown as "A+10G" and the mutation where 21st cytosine from the 
3'-site of intron 13 is substituted with thymine is shown as "C-21T". Positions of 
some SNPs in Table 3 are shown in Fig. 1 together with a schematic structure of the 
ABCG2 polypeptide. Among those SNPs, G34A was present in a leader sequence 
which is important for localization of the ABCG2 polypeptide to a cell membrane and 
C421A was a mutation existing in an ATP binding cassette (ABC) region being 
important for binding to ATP which is transportation energy. Accordingly, those 
mutations have a high possibility of affecting the activity of the ABCG2 polypeptide. 
C376T is a mutation to termination codon existing in the ABC region and the fact that 
ABCG2 loses its activity is clear. Incidentally, each SNP for C496Q T623C, A1444G 
and G1445C is reported in NCBI SNP CLUSTER ED: rs 1061017, NCBI SNP 
CLUSTER ID: rsl061018, Cancer Res. 59, 8-13, 1999 and Proc. Natl. Acad. Sci. USA, 
95, 15665-15670, 1998, respectively but they were not detected in the aforementioned 
cell lines and human-derived samples. 
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[0081] 



Table 3 



olNJrS 


lnuuencc on 

Amino ActH 
^iih^Htntion 


Th y i cti n<r 
Jj,AlolllIg 

Position 


Doma i n 


Prpmiencv in 
30 Cell Lines 


Freouencv in 
149 Human 
Clinical 
Samples 




Vail ZlYlCL 


jZrXon z 


^pniipnpp 


Win 1%\ 


29 riQ 5%^ 


A + i on 




Tntrrvn *^ 




ND 


25 (16.8%) 


LjOV 1 


T\/i-1 0TT\/r 


Exon 4 


ARP 

ADV. 


0 


1 CO 7%) 


V-O / 0 1 


oimzo lerm 


xiAon *t 






o 


C421A 


Glnl41Lys 


Exon 5 


ABC 


6 (20%) 


24(16.1%) 


C458T 


Thrl53Met 


Exon 5 


ABC 


1 (3.3%) 


0 


C474T 


Asp 158 Asp 


Exon 5 


ABC 


0 


1 (0.7%) 


C496G 


Glnl66Glu 


Exon 5 


ABC 


0 


0 


T623C 


Phe208Ser 


Exon 6 


ABC 


0 


0 


A+20G 




Intron 11 




ND 


44 (29.5%) 


A1444G 


Arg482Gly 


Exon 12 


TM3 


0 


0 


G1445C 


Arg482Thr 


Exon 12 


TM3 


0 


0 


C-21T 




Intron 13 




ND 


36 (24.2%) 


A1768T 


Asp590Tyr 


Exon 15 


EC3 


0 


1 (0.7%) 


G2237T 




Exon 16 


3'UTR 


1 (3.3%) 


0 


G2393T 




Exon 16 


3'UTR 


1 (3.3%) 


0 



[0082] 

[Example 2] Preparation of Cell Lines Expressing mutated ABCG2 

Among the polymorphic mutations identified in Example 1, two mutations - 
G34A and C421A - having a high possibility to affect the function of the ABCG2 
polypeptide were prepared and introduced into animal cells as an endeavor to analyze 
their functions. Preparation of the mutated ABCG2 genes was conducted by PCR and 
a point mutation was introduced. After confirming the introduction of the target 
mutations by sequencing, the mutated genes were cloned into Hindlll and Xhol sites of 
an expression vector pcDNA3.1(+) and expression plasmid for each mutant was 
prepared. As a control, a plasmid expressing the wild type (WT) ABCG2 and the 
vector plasmid pcDNA3.1(+) alone without the ABCG2 gene were used and those four 
kinds of expression plasmids were introduced to an animal cell (porcine kidney cell 
line) LLC-PK1 by lipofection method (Lipofectamine; Gibco BRL). Stable 
transfectants were selected with 1,500 jig/ml of Geneticin (Gibco BRL) for two weeks 
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and cell lines were established. To determine the expressed level of ABCG2 in each 
cell line, total RNA was extracted from each transfectant cells and HeLa cells with 
Trizol (Gibco BRL). Seven microgram of the total RNA extracted from each clone of 
cell and full length ABCG2 cDNA (2.2 kb) probe labeled with 32 P were used for 
Northern hybridization. Several transfectants which expressed equal amount of 
ABCG2 mRNA were selected to eliminate the effect of expression level and the result 
is shown in Fig. 2. It is noted that, in Fig. 2, each of the transfectant cells of lanes 
T 4 expressed ABCG2 mRNA in equal amount. Incidentally, GAPDH was used as an 
internal standard for mRNA expressed in each cell. It was noted that the lane 5 was a 
control clone which was transfected by vector alone and no ABCG2 mRNA was 
expressed. 
[0083] 

[Example 3] Evaluation of Resistance to Compound B 

The transfectant cells which were selected in Example 2 and in which nearly 
equal amount of ABCG2 mRNA was expressed were incubated (cultivated) in a 199 
medium containing 1 mM of L-glutamine, 50 units/ml of penicillin, 50 mg/ml of 
streptomycin and 10% by volume of fetal bovine serum. All of the incubations were 
carried out at 37°C under the humidified atmosphere containing 5% of carbon dioxide. 
The cytotoxicity of anticancer drugs was determined by sulforhodamine B dye-staining 
method and compared with each other. Specifically, four kinds of transformed cell 
clones were cultured at 37°C for 72 hours in a medium containing Compounds B or 
camptothecin of various concentrations, then fixed with trichloroacetic acid and 
stained for 30 minutes with 0.4% sulforhodamine B dissolved in 1% acetic acid 
solution. After unbound dye was removed by four washes with 1% acetic acid, 
polypeptide-bound dye was extracted with 10 mM unbuffered Tris base. Then, 
optical density of the extract was measured in a plate reader at 564 nm and 50% 
inhibitory concentration (IC 50 ) values for cell viability were determined. The results 
are shown in Table 4 . In the cell line (1-58) expressing the wild type ABCG2, 
resistance to Compound B has been increased to an extent of 400-fold or more as 
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compared with the cell line (C4) which was transfected with vector alone. On the 
contrary, the resistance of the cell line (2-51) having a mutation of Vall2Met in the 
leader sequence of ABCG2 or the cell line (3-28) having a mutation of Glnl41Lys in 
the ABC region to Compound B increased to an extent of 7.7-fold and 48.2-fold, 
respectively, as compared with C4 ,however, the resistance as compared with that of 
wild type was about 1/10 or less. To camptothecin which is not a substrate for 
ABCG2, there was no significant difference in terms of the resistance among the cells. 
From these results, it was suggested that, the two kinds of mutant ABCG2 (Vall2Met 
and Glnl41Lys) obviously have a decreased capability of excreting Compound B 
which is a topoisomerase inhibitor out of the cell as compared with wild type ABCG2. 
[0084] 



Table 4 Influence of SNPs on Sensitivity to Compound B 



Clone No. 


C4 


1-58 


2-51 


3-28 


SNP Site 




Wild Type 


Leader Sequence 
(Vall2Met) 


ABC Domain 
(Glnl41Lys) 


Expression Level* 


0 


1.90 


2.08 


1.95 


IC 50 (uM Camptothecin) 


0.0087 


0.0213 


0.0122 


0.0268 


IC 50 (uM Compound B) 


0.122 


>50 


0.94 


5.88 


Increased Rate 


1.0 


>409 


7.7 


48.2 



*: Expression level of ABCG2 of each cell line was standardized to HeLa cell (= 1.0). 



[0085] 

[Effect of the Invention] 

By using the method of the present invention, a drug transport capability of a 
mammalian cell can be predicted whereby sensitivity of a patient to various drugs such 
as anti-cancer drugs can be diagnosed and an indicator for the therapy can be obtained. 
In other words, as a result of selecting an anti-cancer drug in cancer, therapy and, 
particularly, detecting a cancer cell(s) which is highly sensitive to indolocarbazole 
compounds, it is now possible to selectively apply the said compounds for the therapy. 
In addition, the optimum dose of the indolocarbazole compounds in the cancer therapy 
is found and, at the same time, side effect of the compounds is reduced whereby a 
highly effective method of using the indolocarbazole compounds is provided. 
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[0086] 

[Sequence Listing] 

SEQUENCE LISTING 
<110> BANYU PHARMACEUTICAL CO., LTD 

<120> A method for predicting a drug transporting capability 
by ABCG2 polymorphisms 

<130> P7194GN 

<140> 
<141> 

<160> 68 

<170> Patent In Ver. 2. 1 

<210> 1 

<211> 1968 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (1).. (1965) 

<400> 1 

atg tct tec agt aat gtc gaa gtt ttt ate cca gtg tea caa gga aac 48 

Met Ser Ser Ser Asn Val Glu Val Phe Me Pro Val Ser Gin Gly Asn 
1 5 10 15 

acc aat ggc ttc ccc gcg aca get tec aat gac ctg aag gca ttt act 96 
Thr Asn Gly Phe Pro Ala Thr Ala Ser Asn Asp Leu Lys Ala Phe Thr 
20 25 30 

gaa gga get gtg tta agt ttt cat aac ate tgc tat cga gta aaa ctg 144 
Glu Gly Ala Val Leu Ser Phe His Asn Me Cys Tyr Arg Val Lys Leu 
35 40 45 
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aag agt ggc ttt eta cct tgt cga aaa cca gtt gag aaa gaa ata tta 192 

Lys Ser Gly Phe Leu Pro Cys Arg Lys Pro Val Glu Lys Glu lie Leu 
50 55 60 

teg aat ate aat ggg ate atg aaa cct ggt etc aac gec ate ctg gga 240 

Ser Asn He Asn Gly He Met Lys Pro Gly Leu Asn Ala lie Leu Gly 
65 70 75 80 



ccc aca ggt gga ggc aaa tct teg tta tta gat gtc tta get gca agg 
Pro Thr Gly Gly Gly Lys Ser Ser Leu Leu Asp Val Leu Ala Ala Arg 
85 90 95 



288 



aaa gat cca agt gga tta tct gga gat gtt ctg ata aat gga gca ccg 
Lys Asp Pro Ser Gly Leu Ser Gly Asp Val Leu lie Asn Gly Ala Pro 
100 105 110 



336 



cga cct gee aat ttc aaa tgt aat tea ggt tac gtg gta caa gat gat 384 
Arg Pro Ala Asn Phe Lys Cys Asn Ser Gly Tyr Val Val Gin Asp Asp 
115 120 125 

gtt gtg atg ggc act ctg acg gtg aga gaa aac tta cag ttc tea gca 432 
Val Val Met Gly Thr Leu Thr Val Arg Glu Asn Leu Gin Phe Ser Ala 
130 135 140 



get ctt egg ctt gca aca act atg acg aat cat gaa aaa aac gaa egg 
Ala Leu Arg Leu Ala Thr Thr Met Thr Asn His Glu Lys Asn Glu Arg 
145 150 155 160 



480 



att aac agg gtc att caa gag tta ggt ctg gat aaa gtg gca gac tec 
lie Asn Arg Val He Gin Glu Leu Gly Leu Asp Lys Val Ala Asp Ser 
165 170 175 



528 



aag gtt gga act cag ttt ate cgt ggt gtg tct gga gga gaa aga aaa 
Lys Val Gly Thr Gin Phe lie Arg Gly Val Ser Gly Gly Glu Arg Lys 
180 185 190 



576 



agg act agt ata gga atg gag ctt ate act gat cct tec ate ttg ttc 
Arg Thr Ser He Gly Met Glu Leu lie Thr Asp Pro Ser lie Leu Phe 
195 200 205 



624 
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ttg gat gag cct aca act ggc tta gac tea age aca gca aat get gtc 672 
Leu Asp Glu Pro Thr Thr Gly Leu Asp Ser Ser Thr Ala Asn Ala Val 
210 215 220 

ctt ttg etc ctg aaa agg atg tct aag cag gga cga aca ate ate ttc 720 
Leu Leu Leu Leu Lys Arg Met Ser Lys Gin Gly Arg Thr He He Phe 
225 230 235 240 

tec att cat cag cct cga tat tec ate ttc aag ttg ttt gat age etc 768 
Ser He His Gin Pro Arg Tyr Ser lie Phe Lys Leu Phe Asp Ser Leu 
245 250 255 

acc tta ttg gee tea gga aga ctt atg ttc cac ggg cct get cag gag 816 
Thr Leu Leu Ala Ser Gly Arg Leu Met Phe His Gly Pro Ala Gin Glu 
260 265 270 

gee ttg gga tac ttt gaa tea get ggt tat cac tgt gag gee tat aat 864 
Ala Leu Gly Tyr Phe Glu Ser Ala Gly Tyr His Cys Glu Ala Tyr Asn 
275 280 285 

aac cct gca gac ttc ttc ttg gac ate att aat gga gat tec act get 912 
Asn Pro Ala Asp Phe Phe Leu Asp lie He Asn Gly Asp Ser Thr Ala 
290 295 300 

gtg gca tta aac aga gaa gaa gac ttt aaa gee aca gag ate ata gag 960 
Val Ala Leu Asn Arg Glu Glu Asp Phe Lys Ala Thr Glu He lie Glu 
305 310 315 320 

cct tec aag cag gat aag cca etc ata gaa aaa tta gcg gag att tat 1008 
Pro Ser Lys Gin Asp Lys Pro Leu lie Glu Lys Leu Ala Glu He Tyr 
325 330 335 

gtc aac tec tec ttc tac aaa gag aca aaa get gaa tta cat caa ctt 1056 
Val Asn Ser Ser Phe Tyr Lys Glu Thr Lys Ala Glu Leu His Gin Leu 
340 345 350 

tec ggg ggt gag aag aag aag aag ate aca gtc ttc aag gag ate age 1104 
Ser Gly Gly Glu Lys Lys Lys Lys. lie Thr Val Phe Lys Glu lie Ser 
355 360 365 
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tac acq acc tec ttc tgt cat caa etc aga tgg gtt tec aag cgt tea 
Tyr Thr Thr Ser Phe Cys His Gin Leu Arg Trp Val Ser Lys Arg Ser 
370 375 380 



1152 



ttc aaa aac ttg ctg ggt aat ccc cag gee tct ata get cag ate att 
Phe Lys Asn Leu Leu Gly Asn Pro Gin Ala Ser Me Ala Gin Me Me 
385 390 395 400 



1200 



gtc aca gtc gta ctg gga ctg gtt ata ggt gee att tac ttt ggg eta 
Val Thr Val Val Leu Gly Leu Val Me Gly Ala lie Tyr Phe Gly Leu 
405 410 415 



1248 



aaa aat gat tct act gga ate cag aac aga get ggg gtt etc ttc ttc 
Lys Asn Asp Ser Thr Gly lie Gin Asn Arg Ala Gly Val Leu Phe Phe 
420 425 430 



1296 



ctg acg acc aac cag tgt ttc age agt gtt tea gee gtg gaa etc ttt 
Leu Thr Thr Asn Gin Cys Phe Ser Ser Val Ser Ala Val Glu Leu Phe 
435 440 445 



1344 



gtg gta gag aag aag etc ttc ata cat gaa tac ate age gga tac tac 1392 

Val Val Glu Lys Lys Leu Phe Me His Glu Tyr Me Ser Gly Tyr Tyr 

450 455 460 

aga gtg tea tct tat ttc ctt gga aaa ctg tta tct gat tta tta ccc 1440 

Arg Val Ser Ser Tyr Phe Leu Gly Lys Leu Leu Ser Asp Leu Leu Pro 

465 470 475 480 



atg agg atg tta cca agt att ata ttt acc tgt ata gtg tac ttc atg 
Met Arg Met Leu Pro Ser Me Me Phe Thr Cys Me Val Tyr Phe Met 
485 490 495 



1488 



tta gga ttg aag cca aag gca gat gee ttc ttc gtt atg atg ttt acc 1536 

Leu Gly Leu Lys Pro Lys Ala Asp Ala Phe Phe Val Met Met Phe Thr 
500 505 510 

ctt atg atg gtg get tat tea gee agt tec atg gca ctg gee ata gca 1584 

Leu Met Met Val Ala Tyr Ser Ala Ser Ser Met Ala Leu Ala Me Ala 
515 520 525 
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gca ggt cag agt gtg gtt tct gta gca aca ctt etc atg acc ate tgt 1632 
Ala Gly Gin Ser Val Val Ser Val Ala Thr Leu Leu Met Thr Me Cys 
530 535 540 

ttt gtg ttt atg atg att ttt tea ggt ctg ttg gtc aat etc aca acc 1680 
Phe Val Phe Met Met Me Phe Ser Gly Leu Leu Val Asn Leu Thr Thr 
545 550 555 560 

att gca tct tgg ctg tea tgg ctt cag tac ttc age att cca cga tat 1728 
Me Ala Ser Trp Leu Ser Trp Leu Gin Tyr Phe Ser Me Pro Arg Tyr 
565 570 575 

gga ttt acg get ttg cag cat aat gaa ttt ttg gga caa aac ttc tgc 1776 
Gly Phe Thr Ala Leu Gin His Asn Glu Phe Leu Gly Gin Asn Phe Cys 
580 585 590 

cca gga etc aat gca aca gga aac aat cct tgt aac tat gca aca tgt 1824 
Pro Gly Leu Asn Ala Thr Gly Asn Asn Pro Cys Asn Tyr Ala Thr Cys 
595 600 605 

act ggc gaa gaa tat ttg gta aag cag ggc ate gat etc tea ccc tgg 1872 
Thr Gly Glu Glu Tyr Leu Val Lys Gin Gly lie Asp Leu Ser Pro Trp 
610 615 620 

ggc ttg tgg aag aat cac gtg gec ttg get tgt atg att gtt att ttc 1920 
Gly Leu Trp Lys Asn His Val Ala Leu Ala Cys Met Me Val Me Phe 
625 630 635 640 

etc aca att gee tac ctg aaa ttg tta ttt ctt aaa aaa tat tct taa 1968 
Leu Thr Me Ala Tyr Leu Lys Leu Leu Phe Leu Lys Lys Tyr Ser 
645 650 655 



<210> 2 
<211> 655 
<212> PRT 
<213> Homo sapiens 

<400> 2 

Met Ser Ser Ser Asn Val Glu Val Phe Me Pro Val Ser Gin Gly Asn 
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1 



10 



15 



Thr Asn Gly Phe Pro Ala Thr Ala Ser Asn Asp Leu Lys Ala Phe Thr 
20 25 30 

Glu Gly Ala Val Leu Ser Phe His Asn lie Cys Tyr Arg Val Lys Leu 
35 40 45 

Lys Ser Gly Phe Leu Pro Cys Arg Lys Pro Val Glu Lys Glu Me Leu 
50 55 60 

Ser Asn lie Asn Gly Me Met Lys Pro Gly Leu Asn Ala Me Leu Gly 
65 70 75 80 

Pro Thr Gly Gly Gly Lys Ser Ser Leu Leu Asp Val Leu Ala Ala Arg 
85 90 95 

Lys Asp Pro Ser Gly Leu Ser Gly Asp Val Leu Me Asn Gly Ala Pro 
100 105 110 

Arg Pro Ala Asn Phe Lys Cys Asn Ser Gly Tyr Val Val Gin Asp Asp 
115 120 125 

Val Val Met Gly Thr Leu Thr Val Arg Glu Asn Leu Gin Phe Ser Ala 
130 135 140 

Ala Leu Arg Leu Ala Thr Thr Met Thr Asn His Glu Lys Asn Glu Arg 
145 150 155 160 

Me Asn Arg Val Me Gin Glu Leu Gly Leu Asp Lys Val Ala Asp Ser 
165 170 175 

Lys Val Gly Thr Gin Phe Me Arg Gly Val Ser Gly Gly Glu Arg Lys 
180 185 190 

Arg Thr Ser Me Gly Met Glu Leu Me Thr Asp Pro Ser Me Leu Phe 
195 200 205 



Leu Asp Glu Pro Thr Thr Gly Leu Asp Ser Ser Thr Ala Asn Ala Val 
210 215 220 
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Leu Leu Leu Leu Lys Arg Met Ser Lys Gin Gly Arg Thr lie lie Phe 
225 230 235 240 

Ser lie His Gin Pro Arg Tyr Ser Me Phe Lys Leu Phe Asp Ser Leu 
245 250 255 

Thr Leu Leu Ala Ser Gly Arg Leu Met Phe His Gly Pro Ala Gin Glu 
260 265 270 

Ala Leu Gly Tyr Phe Glu Ser Ala Gly Tyr His Cys Glu Ala Tyr Asn 
275 280 285 

Asn Pro Ala Asp Phe Phe Leu Asp Me Me Asn Gly Asp Ser Thr Ala 
290 295 300 

Val Ala Leu Asn Arg Glu Glu Asp Phe Lys Ala Thr Glu Me lie Glu 
305 310 315 320 

Pro Ser Lys Gin Asp Lys Pro Leu Me Glu Lys Leu Ala Glu Me Tyr 
325 330 335 

Val Asn Ser Ser Phe Tyr Lys Glu Thr Lys Ala Glu Leu His Gin Leu 
340 345 350 

Ser Gly Gly Glu Lys Lys Lys Lys Me Thr Val Phe Lys Glu Me Ser 
355 360 365 

Tyr Thr Thr Ser Phe Cys His Gin Leu Arg Trp Val Ser Lys Arg Ser 
370 375 380 

Phe Lys Asn Leu Leu Gly Asn Pro Gin Ala Ser Me Ala Gin Me Me 
385 390 395 400 

Val Thr Val Val Leu Gly Leu Val Me Gly Ala Me Tyr Phe Gly Leu 
405 410 415 

Lys Asn Asp Ser Thr Gly lie Gin Asn Arg Ala Gly Val Leu Phe Phe 
420 425 430 
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Leu Thr Thr Asn Gin Cys Phe Ser Ser.Val Ser Ala Val Glu Leu Phe 
435 440 445 

Val Val Glu Lys Lys Leu Phe lie His Glu Tyr lie Ser Gly Tyr Tyr 
450 455 460 

Arg Val Ser Ser Tyr Phe Leu Gly Lys Leu Leu Ser Asp Leu Leu Pro 
465 470 475 480 

Met Arg Met Leu Pro Ser lie lie Phe Thr Cys 1 1 e Va I Tyr Phe Met 
485 490 495 

Leu Gly Leu Lys Pro Lys Ala Asp Ala Phe Phe Val Met Met Phe Thr 
500 505 510 

Leu Met Met Val Ala Tyr Ser Ala Ser Ser Met Ala Leu Ala lie Ala 
515 520 525 

Ala Gly Gin Ser Val Val Ser Val Ala Thr Leu Leu Met Thr Me Cys 
530 535 540 

Phe Val Phe Met Met Me Phe Ser Gly Leu Leu Val Asn Leu Thr Thr 
545 550 555 560 

lie Ala Ser Trp Leu Ser Trp Leu Gin Tyr Phe Ser lie Pro Arg Tyr 
565 570 575 

Gly Phe Thr Ala Leu Gin His Asn Glu Phe Leu Gly Gin Asn Phe Cys 
580 585 590 

Pro Gly Leu Asn Ala Thr Gly Asn Asn Pro Cys Asn Tyr Ala Thr Cys 
595 600 605 

Thr Gly Glu Glu Tyr Leu Val Lys Gin Gly Me Asp Leu Ser Pro Trp 
610 615 620 

Gly Leu Trp Lys Asn His Val Ala Leu Ala Cys Met Me Val Me Phe 
625 . 630 635 640 

Leu Thr Me Ala Tyr Leu Lys Leu Leu Phe Leu Lys Lys Tyr Ser 
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645 



650 



655 



<210> 3 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 1 forward 
primer 

<400> 3 

gtgcccactc aaaaggtt 



<210> 4 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :Exon 1 reverse 
primer 

<400> 4 

tccagtcaaa gctgtactct g 



<210> 5 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 2 forward 
primer 

<400> 5 
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atgtattgtc acctagtgtt tg 



<210> 6 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 2 reverse 
primer 

<400> 6 

aaagtgtgaa gccttgagca ga 



<210> 7 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 3 forward 
primer 

<400> 7 

aacggagatg tttcacaaga 



<210> 8 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 3 reverse 
primer 

<400> 8 

tacaataaag ccccaaaaca 
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<210> 9 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 4 forward 
primer 

<400> 9 

gaggaaaaag aatgggagaa 



<210> 10 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 4 reverse 
primer 

<400> 10 

gtctgcaaag cctgctataa 



<210> 11 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 5 forward 
primer 

<400> 11 

ttccttcacc tttcttttcc 
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<210> 12 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :Exon 5 reverse 
primer 

<400> 12 

cttccataaa actggtccct 20 



<210> 13 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 6 forward 
primer 

<400> 13 

gaggtgcttt gtatcaggct 20 



<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 6 reverse 
primer 

<400> 14 

gatcaggcca gtaggtcaac 20 
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<210> 15 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial SequencerExon 7 forward 
primer 

<400> 15 

cttgtaaata cttgcagatt acctg 



<210> 16 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 7 reverse 
primer 

<400> 16 

tgttcaagtg acagaataaa tggct 



<210> 17 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 8 forward 
primer 

<400> 17 

aaagggtaaa attacgtggg 



<210> 18 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 8 reverse 
primer 

<400> 18 

gcaaacaaac tgacgttttc 20 



<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 9 forward 
primer 

<400> 19 

aatgaaggtg ttagggaagc 20 



<210> 20 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 9 reverse 
primer 

<400> 20 

ctggctgaca cttctttcac 20 



<210> 21 
<211> 22 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 10 forward 
primer 

<400> 21 

tctccccaaa gcacagataa ct 



<210> 22 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 10 reverse 
primer 

<400> 22 

catttaaaaa taattgggcc aggtg 



<210> 23 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 11 forward 
primer 

<400> 23 

ctaattacct tccaaagggc 



<210> 24 
<211> 20 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 11 reverse 
primer 

<400> 24 

aaaccaggct gctctttact 20 



<210> 25 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 12 forward 
primer 

<400> 25 

gctgggtatt tttcaaggat 20 



<210> 26 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial SequencerExon 12 reverse 
primer 

<400> 26 

agagagtgca aaatggacag 20 



<210> 27 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence:Exon 13 forward 
primer 

<400> 27 

tgcctgtagc tcttcatctc 



28 
20 
DNA 

Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :Exon 13 reverse 
primer 

<400> 28 

acgagaggga accaaaatag 



<210> 29 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 14 forward 
primer 

<400> 29 

ctttttggca gctttaaatg atagc 



<210> 30 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence :Exon 14 reverse 
primer 

<400> 30 

aatctttctc ctttactagg aggta 25 



<210> 31 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 15 forward 
primer 

<400> 31 

tttacttctt ttgtattgga agcca 25 



<210> 32 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 15 reverse 
primer 

<400> 32 

tagaggataa atcgattgat aggga 25 



<210> 33 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial Sequence:Exon 16 forward 
primer 

<400> 33 

atctgaaggg gtaattatta aaggc 25 



<210> 34 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 16 reverse 
primer 

<400> 34 

tgttccagaa atggtgcaag aattc 25 



<210> 35 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 1 sense 
primer 

<400> 35 

gtgcccactc aaaaggtt 18 



<210> 36 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : Exon 1 
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antisense primer 
<400> 36 

caagagtttt taccaaccca 

<210> 37 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 2 sense 
primer 

<400> 37 

atgtattgtc acctagtgtt tg 



<210> 38 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 2 
antisense primer 

<400> 38 

gtggcccaat tatttcact 



<210> 39 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 3 sense 
primer 
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<400> 39 

taagagttgg tttgtgcttg 



20 



<210> 40 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 3 
antisense primer 

<400> 40 

aacatggtca actgctacat 20 



<210> 41 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 4 sense 
primer 



<210> 42 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 4 
antisense primer 



<400> 41 

atgttttggg gctttattg 



19 
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<400> 42 

tattccagat tctccctgc 



<210> 43 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 5 sense 
primer 

<400> 43 

caggctttgc agacatcta 



<210> 44 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 5 
antisense primer 

<400> 44 

attgttatgg aaagcaacca 

<210> 45 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 6 sense 
primer 

<400> 45 
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gaggtgcttt gtatcaggct 



<210> 46 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 6 
antisense primer 

<400> 46 

caccctcatc acagacatc 



<210> 47 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 7 sense 
primer 

<400> 47 

ctgtcctaga atctgcattt 



<210> 48 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 7 
antisense primer 

<400> 48 

agctggtgct acaaaaat 
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<210> 49 
- <211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial SequencerExon 8 sense 
primer 

<400> 49 

aaagggtaaa attacgtggg 20 



<210> 50 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 8 
antisense primer 

<400> 50 

tctggttgtt gcttcctact 20 



<210> 51 
<211> 19 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence.Exon 9 sense 
primer 

<400> 51 

gttagggaag catccaaga 19 
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<210> 52 
<211> 20 
<212> DNA 

<213> Artificial Seauence 
<220> 

<223> Description of Artificial Sequence:Exon 9 
antisense primer 

<400> 52 

agggaagctt tccaaaagta 



<210> 53 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Exon 10 sense 
primer 

<400> 53 

tctccccaaa gcacagataa ct 



<210> 54 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 10 
antisense primer 

<400> 54 

tggtggtgga tgtctgtagt 
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<210> 55 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 11 sense 
primer 

<400> 55 

ctaattacct tccaaagggc 



<210> 56 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 11 
antisense primer 

<400> 56 

gctcaggatt ttcttcccta 



<210> 57 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 12 sense 
primer 

<400> 57 

ctggactgag tgttcaggag 



<210> 58 
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<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 12 
antisense primer 

<400> 58 

agagagtgca aaatggacag 

<210> 59 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 13 sense 
primer 

<400> 59 

tgcctgtagc tcttcatctc 



<210> 60 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence.Exon 13 
antisense primer 

<400> 60 

ataagggcaa agaggaaagt 



<210> 61 
<211> 21 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 14 sense 
primer 

<400> 61 

tttgttcttc ctttaaaacc g 



<210> 62 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 14 
antisense primer 

<400> 62 

aatctttctc ctttactagg aggta 



<210> 63 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 15 sense 
primer 

<400> 63 

tttacttctt ttgtattgga agcca 



<210> 64 
<211> 20 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :Exon 15 
antisense primer 

<400> 64 

aaaaggccca aaacaataag 20 



<210> 65 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 1 6— T sense 
primer 

<400> 65 

atctgaaggg gtaattatta aaggc 25 



<210> 66 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 16-1 
antisense primer 

<400> 66 

caggagtttc cagaattcaa 20 



<210> 67 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Description of Artificial Sequence:Exon 16-2 sense 
primer 

<400> 67 

tgttgttttc tgttcccttg 20 



<210> 68 
<211> 25 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:Exon 16-2 
antisense primer 

<400> 68 

tgttccagaa atggtgcaag aattc 25 
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[Brief Description of the Drawings] 
[Fig.l] 

Fig. 1 is a schematic diagram of the ABCG2 polypeptide showing the 
positions of single nucleotide polymorphisms of the present invention. 
[Fig-2] 

Fig. 2 is a result of Northern blot analysis by which the amount of the ABCG2 
mRNA in various transformed cells was determined. 
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